A guide to best practices for Gene Ontology (GO) manual annotation
نویسندگان
چکیده
The Gene Ontology Consortium (GOC) is a community-based bioinformatics project that classifies gene product function through the use of structured controlled vocabularies. A fundamental application of the Gene Ontology (GO) is in the creation of gene product annotations, evidence-based associations between GO definitions and experimental or sequence-based analysis. Currently, the GOC disseminates 126 million annotations covering >374,000 species including all the kingdoms of life. This number includes two classes of GO annotations: those created manually by experienced biocurators reviewing the literature or by examination of biological data (1.1 million annotations covering 2226 species) and those generated computationally via automated methods. As manual annotations are often used to propagate functional predictions between related proteins within and between genomes, it is critical to provide accurate consistent manual annotations. Toward this goal, we present here the conventions defined by the GOC for the creation of manual annotation. This guide represents the best practices for manual annotation as established by the GOC project over the past 12 years. We hope this guide will encourage research communities to annotate gene products of their interest to enhance the corpus of GO annotations available to all. DATABASE URL: http://www.geneontology.org.
منابع مشابه
Gene Ontology Evidence Sentence Extraction and Concept Extraction: Two Rule-Based Approaches
Gene Ontology (GO) annotation have been relying on human annotation to capture accurate description of the published full-length literature. Though manual annotation may provide promising quality of the task. However, it is labour-intensive and time-consuming. In turn, we developed two different methods: a sequential pattern mining algorithm and GREPC (Geneontology concept Recognitionby Entity,...
متن کاملManual of GO-function
The GO-function package is an enrichment analysis tool for Gene Ontology (GO) [1]. According to several explicit rules, it is designed for treating the redundancy resulting from the GO structure or multiple annotation genes. Different from current redundancy treatment tools [2, 3, 4, 5] simply based on some numerical considerations, GO-function can find terms which are both statistically interp...
متن کاملGene Ontology: Pitfalls, Biases, Remedies
The Gene Ontology (GO) is a formidable resource but there are several considerations about it that are essential to understand the data and interpret it correctly. The GO is sufficiently simple that it can be used without deep understanding of its structure or how it is developed, which is both a strength and a weakness. In this chapter, we discuss some common misinterpretations of the ontology...
متن کاملThe GOA database in 2009—an integrated Gene Ontology Annotation resource
The Gene Ontology Annotation (GOA) project at the EBI (http://www.ebi.ac.uk/goa) provides high-quality electronic and manual associations (annotations) of Gene Ontology (GO) terms to UniProt Knowledgebase (UniProtKB) entries. Annotations created by the project are collated with annotations from external databases to provide an extensive, publicly available GO annotation resource. Currently cove...
متن کاملThe GOA database: Gene Ontology annotation updates for 2015
The Gene Ontology Annotation (GOA) resource (http://www.ebi.ac.uk/GOA) provides evidence-based Gene Ontology (GO) annotations to proteins in the UniProt Knowledgebase (UniProtKB). Manual annotations provided by UniProt curators are supplemented by manual and automatic annotations from model organism databases and specialist annotation groups. GOA currently supplies 368 million GO annotations to...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 2013 شماره
صفحات -
تاریخ انتشار 2013